RANDOM WALKS ON THE TORUS WITH SEVERAL 

GENERATORS 

TIMOTHY PRESCOTT* AND FRANCIS EDWARD SU** 

Abstract. Given n vectors {di}2^i e [0, 1)'', consider a random walk on the d- 
dimensional torus T"* = R'^/Z'^ generated by these vectors by successive addition 
and subtraction. For certain sets of vectors, this walk converges to Haar (uni- 
form) measure on the torus. We show that the discrepancy distance D{Q*^) be- 
tween the fc-th step distribution of the walk and Haar measure is bounded below 
by D{Q*^) > CiA;~"/^, where Ci — C{n, d) is a constant. If the vectors are badly 
approximated by rationals (in a sense we will define) then D{Q*^) < C2k~"^'^'^ 
for C2 = C(n, d, dj) a constant. 



Let T*^ = /U^ denote the (i- dimensional torus. As a quotient group of it 
is an additive group, so the group elements may be viewed as elements of [0, l)'', 
with the group operation defined as coordinate- wise addition mod 1. 

Let ai, . . . , On be vectors in T'^ = [0, l)'', and consider the random walk on 
the ci-dimensional torus T'^ that proceeds as follows. Start at 0. At each step, 
choose one the vectors cJj with probability 1/n and add or subtract that vector 
(with probability 1/2) to the current position to get to the next position in the 
walk. 

As a random walk on a group, the fc-th step distribution of the walk converges to 
a limiting distribution [Hj, and in many cases this will be Haar measure, the unique 
translation-invariant measure on the group. For the torus T'^, Haar measure may 
be thought of as the uniform distribution on the "flat" cube [0, l)*^, since addition 
corresponds to translation on /WJ^. We shall prove bounds for how quickly this 
random walk approaches Haar measure on the torus. 
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We first note that for certain sets of vectors, this walk may not converge to 
Haar measure. For instance, if all the entries of each di are rational, then the 
random walk will not converge to Haar measure, but will converge to a limiting 
distribution supported on a discrete subgroup of T''. As another example, if there 
is only one generator di = {x,x, ...,x) for some irrational x, then the walk will be 
supported on a circle along the "diagonal" of the torus. (However, a single vector 
can generate a walk that does converge to Haar measure, provided it is chosen 
well.) 

Let Q denote the generating measure for this random walk, i.e., if = U"^^{+aj, — 
is the set of generators of the random walk, then for a set -B C T'^, let Q{B) = 
\B n 51 /1 51 where | ■ | denotes the size of a finite set. The k-th step probability 
distribution is then given by the k-th convolution power of Q, which we denote by 
Q*'^. Let U denote Haar measure. 

As a measure of distance between the probability distributions Q*'' and U, we will 
use the discrepancy metric, which is defined to be the supremum of the difference 
of two probability measures over all "boxes" in T"^ = R'^/Z'^ with sides parallel to 
the axes in Mf^, i.e., of the form [ai, bi) x [a2, 62) x ... x [a^, bfi). Let D{Q*^) denote 
the discrepancy of Q*^ from Haar measure U: 

D{Q*''):= sup \Q*'{B)-U{B)\. 

boxBCT'' 

The discrepancy metric has been used by number theorists to study the uniform 
distribution of sequences mod 1, e.g., see [3 E]. Diaconis Ij suggested its use 
for the study of rates of convergence for random walks on groups. It admits 
Fourier bounds j3] and has many other nice properties and connections with other 
probability metrics jS]. 

Although the total variation metric is more commonly used to study the conver- 
gence of random walks, we do not use it here because this random walk does not 
converge in total variation (in fact, the total variation distance between Q*^ and 
U is always 1, since at any step Q*^ is supported on a finite set). The possibility of 
using Fourier analysis to bound the discrepancy distance makes it a more desirable 
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choice than other common metrics on probabihties, such as the Prohorov metric, 
and has allowed many recent results for the study of discrete random walks on 
continuous state spaces (e.g., [121 IE!)- Most of the literature for rates of conver- 
gence of random walks have been limited to walks on finite groups or state spaces, 
and those that have focused on infinite compact groups (e.g., [Hj, 0, [IB]) have 
studied walks generated by continuous measures. By contrast, the walk we study 
is generated by a discrete set of generators on an infinite group. 
We prove: 

Theorem 1. Let Q denote the generating measure of the the random walk on the 
d-torus generated by n vectors «!,...,«„. Then the k-th step probability distribution 
Q*^ satisfies: 

This result holds for any set of vector generators. On the other hand, for certain 
sets of badly approximable generators (to be defined later), we can establish the 
following upper bound. 

Theorem 2. Let Anxd be a badly approximable matrix, with rows ai,...,a„, and 
approximation constant Ca- If Q is the generating measure of the random walk 
on the d-torus generated by the on, then the k-th step probability distribution Q*^ 
satisfies: 

d / \ n/d 



We note that the case d = 1 corresponds to a random walk on the circle, which 
has been studied for a single generator and for several generators [1]. 

1. Lower Bound 

The following notation will be used throughout this paper: 
||x||: the Euclidean (L^) norm of a vector x 
||x||oo: the supremum norm of a vector x 

{x}: the Euclidean (L^) distance from x to the nearest integral point 
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{x}oo: the supremum distance from x to the nearest integral point 
To estabhsh a lower bound for the discrepancy, we use a lemma due to Dirichlet: 

Lemma 3 (Dirichlet 1842). Given any real n x d matrix A and q > I, there is 
some h G Z"' such that < ||h||oo < g"/'^ and {Ah}^ < 1/q. 

A simple proof using a pigeonhole argument may be found in ^T] . We now prove 
Theorem 

Proof. Su has shown that for any probability distribution P on T*^: 

1 1/2 



(1) D{P) > sup 

rg(0,.5]'* 



E i^(h)i^n 



i=l 



if hi = 



where -P(h) is the Fourier transform of P, i.e., -P(h) = J^^ e'^^^^'^Qid^)- We will 
use this formula to bound D[Q*^) where Q is the generating measure of our random 
walk. Note that: 



27rih-(3, 



1 

1 " 

= — cos(27rh • dj). 

Since cos(2'7rx) = cos(27r{x}) > 1 — 27r^{x}^, we have 

1 " 

Q(h) > -5]]l-27r2{aj-h} 

i=i 

27r" 



> 1 



27r2 



{Ah}^ 



where A = {did2 ■ ■ ■ dn)~^ is the nxd matrix whose rows are the a/s. Also, noting 
that Q*^(h) = (5^(h) and that (1 — x)'^ > 1 — kx for k > 1 and x < 1, we have 
that 

-{Ahy 



Q*^(h) > 1 r /.U12 
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as long as 27c'^k{Ah}'^ /n < 1. This is ensured by setting Zi = 27r^/25 < 1 and let- 
ting q = (2Tx'^kd/ ZiY^"^ . Then Lemma 01 implies that there exists h G Z*^ such that 
< ||h||oo < g"/"^ and {Ah.}^ < l/q. This yields 2T:'^k{A\vY /n < 27r'^kd{Ah}l^ < 



2TT^kd/q^ = Zi < 1, as desired. (Note that \Q{h)\'' > 1 - Zi.] 
inequality at this h we find 



> sup 

rg(0,.5]'* 



(l-Zi)n| 

i=l ^ 



sin{2iThiri) 

2n 



if hi = 



Then, if we let = 1/Ahi < 1/2 if hi ^ and = l/27r if hi 
R{h) = ]^"^j^max{l, \hi\} to relate the size of h, we find that 

;i-zi)n 



> 



i=l 



iihi^O 
if hi = 



> 



> 



> 



> 



> 



> 



d 
oo 



7r^(27r2A;rf/Zi)"/2 



^d^n(jn/2 
1 



By evaluating 



and define 



□ 



2. Upper Bound 

We now seek an upper bound on the discrepancy of the random walk when our 
generators arise as rows of a badly approximable matrix. 

Definition 4. We say an n x d matrix A is badly approximable if there exists a 
constant Ca such that {A\y}oo > CA/||h||^" for all non-zero h G Z"'. We call Ca 
the approximation constant of A. 
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Note that Lemma El implies that for any matrix A (not just badly approximable 
ones , {Ah}oo < l/||h||:^" for infinitely many h G Z'^. Thus we say A is badly 
approximable if the reverse inequality holds (up to a constant Ca) for all h G Z*^. 
This definition closely follows Schmidt ^U]; who defines badly approximable linear 
forms; this corresponds to our definition by noting Ah is a linear form in the 
variables h^. 

As a subset of W^'^, the set of badly approximable matrices has Lebesgue measure 
zero ISj although their Hausdorff dimension is nd and there are uncountably many 
of them 

We now prove Theorem |21 

Proof. It is known [2] from Erdos, Turan, and Koksma that for all positive integers 



M, 



(2) 




Since | cos(27rx)| < 1 — 4{2a;} 



^ for all X G M, it follows that 






< 



1 - -{2Ahy 



n 



< 





0<\\h 



<M 
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Since M may be chosen freely, choose an integer M such that 
(3) M<U'J^X"'<M^1. 

Here Ca is an approximation constant for the badly approximable A and k is the 
number of steps in the walk. We can show: 



Lemma 5. With S and M defined as above, S < 



0.5 
M+l ■ 



Before proving this lemma, we show how the theorem follows. From inequal- 
ity 0, we find 

?,Y f 2 \ fiV f 2 0.5 \ f?,Y 2.5 



2J \M + l )~\^) \M^\ M+l) \2 J M + l 
From the inequality for M + 1 in Q , we have 



d / 2 \ n/2d /Q\d / \ n/d 



which concludes the proof of Theorem |21 □ 

All that remains is to prove Lemma 

Proof of Lemma\^ We shall bound S in three stages: (1) first, we group the terms 
of S into "cohorts" based on the size of ||h||oo, (2) we note that the points 2A][\ are 
bounded away from each other in T*^ and therefore can bound the terms within each 
cohort based on the size of {2y4h}oo, and (3) estimating the resulting expression. 

(1) Grouping the terms of S by the size of ||h||oo- Choose an integer J such that 

2-^-^ < M <2-^ -I. 
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The sum in S may be grouped into J cohorts of integers ||h||oo G Hj := [2^^^, 2^ — 1] 
for j = {1, . . . , J}. Therefore, 



R(h) 



< 



J 

< 



f^Sj Bounding the terms within each cohort. Within each cohort [2^~^, 2^ — 1], since 
h is a non-zero integral vector, the use of DefinitionlUyields {274h}oo > CA/||2h||^"' 
where Ca is the approximation constant of the matrix A. Therefore, each 2Ah is 
bounded away from any integral point by C^/||2h||J^". In fact, they are also 
bounded away from each other, since if ||hi||oo, ||h2||oo G [2"' ^ 2^ — 1] and hi 7^ h2, 
then ||hi - hsiloo < 2^+^ and 

{2Aih,-h,)U>—-^^--^> 



2'i/"||hi - h2||S^" ~ 2"'/"(2J'+^)'^/" 2(J'+2W'^' 

Therefore, we divide the unit cube [0, 1]" into subcubes of side-length Ca/ (2'^^-''*"^^/") 
and distribute the points {2y4h} throughout them. In the worst case, all of the 
points are distributed near the corners of the cube and occupy adjacent subcubes. 
Therefore, 



j=i i=i \ ^ 

< E2"*-'E(' + ir'-p(-;|Ji^)- 

j=i i=i ^ ^ 



2^ 
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(3) Estimating the resulting expression. Since M > 2-'-'^ and 

^ - 2{CAr ~ ^(Ca? " 2{CAf ' 
we can say that: 

J oo 

j=i i=i 

Since j < J, i > 1 and n > 1, the log derivative with respect to i of the inner 
sum can be bounded: 

^ - 4in(2^-^y'^/" < ^ - 4m < -4i < -4. 

Therefore, the expression in the inner sum decreases geometrically by at least 
the ratio e~^, and the inner sum can be bounded by the first term (at i — 1) times 
the constant 1/(1 — e~^). Therefore, 

S < ^ |!l^2-^xp (-2n22(^-^W-) 



< 



22n 



1 - e-4 



J]2-^exp(-2n22(-^-^)<^/"). 



This sum may be bounded by noting that the largest term occurs when j — 
J. For j < J, the log derivative of the terms with respect to j is ln2(— 1 + 
^22+2(J-j)d/n) > ln2(-l + 2^) = In 8. Therefore, the sum decreases geometrically 
with ratio at least 1/8 as j < J decreases, so the sum is bounded by seven-eighths 
the final term at j — J. Also, recalling that M < 2"^ — 1, 

92n 7 7 /9\2" 1 

S < ^-_V^exp(-2n22(^-^W") = — (-) — ^ 
- 1-6-^8 ^ 8{l-e-^)\eJ M + 1 

28 1 0.5 

< ^— < 



8(e2-e-2)M + l " M+l' 
as was to be shown. □ 

So, for badly approximable matrices A, we have the following discrepancy bounds 
on the associated random walk: 

C{n,d) C{n,d,A) 
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In general, these bounds do not match unless d = 1. In that case, we recover the 
same order of convergence as in 

We conjecture that in the lower bound of Theorem ^ (which applies to any 
random walk on the torus generated by a finite set of vectors) the A;"/^ may be 
improved to /c"/^*^ (for all matrices, not just badly approximable ones). This would 
yield matching upper and lower bounds for random walks on the (i-torus generated 
by the rows of a badly approximable matrix. This would confirm that such walks 
converge the fastest among all finitely-generated random walks on the (i-torus, 
a fact that has already been shown in j3] for dimension d = 1. Developing an 
approximation lemma similar to Dirichlet's Lemma (Lemma IH} that bounds -R(h) 
instead of ||h||oo may help in this regard. 
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